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SINGLE-CHAIN BIFUNCTIONAL GLYCOPROTEIN HORMONES 



Acknowledgment of Government Support 

This invention was made in part with government support under NIH Contract 
5 No. NOl-HD-9-2922, awarded by the National Institutes of Health. The government has 
certain rights in this invention. 



Technical Field 

The invention relates to the field of protein engineering, specifically to modified 
1 0 forms of certain glycoprotein hormones which occur normally as heterodimers. The 

invention concerns modified single-chain forms of chorionic gonadotropin (CG), thyroid 
stimulating hormone (TSH), luteinizing hormone (LH), and follicle stimulating hormone 
(FSH) that can provide two effects or functions, or can behave generally as agonists 
and/or antagonists of the native hormones. 

15 

Background Art 

In humans, four important glycoprotein hormone heterodimers (LH, FSH, TSH 
and CG) have identical a subunits and differing P subunits. Three of these hormones are 
present in virtually all other vertebrate species as well; CG has so far been found only in 

2 0 primates and in the placenta and urine of pregnant mares. 

PCT application WO90/09800, published 7 September 1990, and incorporated 
herein by reference, describes a number of modified forms of these hormones. One 
important modification is C-terminal extension of the p subunit by the carboxy terminal 
peptide (CTP) of human chorionic gonadotropin or a variant thereof Other muteins of 

2 5 these hormones are also described. CTP is the sequence of amino acids extending from 
any one of positions 1 12-118 to position 145 of the p subunit of human chorionic 
gonadotropin. The PCT application describes variants of the CTP extension obtained by 
conservative amino acid substitutions such that the capacity of the CTP to alter the 
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clearance characteristics is not destroyed. In addition, PCX application W094/24148 
published 27 October 1994, incorporated herein by reference, describes modifying these 
hormones by extension or insertion of the CTP at locations other than the C-terminus and 
CTP fragments shorter than the sequence extending from positions 1 12-1 18 to 145, 

The CTP-extended p subunit of FSH is also described in two papers by applicants 
herein: LaPolt, P.S. et al\ Endocrinology (1992) 131:2514-2520 and Fares, F.A. et al; 
Proc Natl Acad Sci USA (1992) 89:4304-4308. Both of these papers are incorporated 
herein by reference. 

The crystal structure of the heterodimeric form of human chorionic gonadotropin 
has now been published in more or less contemporaneous articles; one by Lapthom, A.J. 
et al Nature (1994) 369:455-461 and the other by Wu, H. et al Structure (1994) 2:545- 
558. The results of these articles are summarized by Patel, D.J. Nature (1994) 369:438- 
439. 

PCT application W091/16922 published 14 November 1991 describes a 
multiplicity of chimeric and otherwise modified forms of the heterodimeric glycoprotein 
hormones. In general, the disclosure is focused on chimeras of a subunits or |3 subunits 
involving portions of various a or p chains respectively. One construct simply listed in 
this application, and not otherwise described, fuses substantially all of the P chain of 
human chorionic gonadotropin to the a subunit preprotein, i.e., including the secretory 
signal sequence for this subunit. 

Two additional published PCT applications describe single chain forms of these 
hormones wherein the a and p unit are covently linked to result in a fusion peptide of the 
general formula: 

p (linker)^ a or 

a (linker)^, p 

wherein n is 0 or 1 and a and p represent the respective subunits of these 
hormones: Moyle, W.R,, PCT application WO95/22340 published 24 August 1995 and 
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the application of the inventor herein, WO96/05224 published 22 February 1996. The 
disclosure of these documents is also incorporated herein by reference. 

Forms of the above-described single-chain glycoprotein hormones in which the 
number of cystine bridges has been depleted are disclosed in US Serial No. 08/933,693 
5 filed 19 September^ 1997, and incorporated herein by reference. 

It has now been found possible to construct single-chain forms of the glycoprotein 
hormones which have enhanced agonist and/or antagonist activity and/or which are 
bifunctional by including two P subunits in a single-chain so that they share a conunon a 
subunit. These forms may contain various CTP extensions and insertions as well as 
1 0 variants of the native forms of the a and p subunits and of CTP as described in the 
documents set forth above. 

Disclosure of the Invention 

The invention provides single-chain forms of the glycoprotein hormones that 
15 contain two P subunits that may be the same or different. The single-chain forms of the 
invention may either be glycosylated, partially glycosylated, or nonglycosylated and the a 
and P chains that occur in the native glycoprotein hormones or variants of them may 
optionally be linked through a linker moiety. Particularly preferred linker moieties 
include the carboxy terminal peptide (CTP) imit either as a complete xmit or a variant 
2 0 including variants which represent only a portion thereof. The resulting single-chain 
hormones either retain or enhance the activity of the unmodified heterodimeric forms or 
are antagonists of this activity. If the two p subunits are different, they are bifunctional as 
agonists and/or antagonists. 

Thus, in one aspect, the invention is directed to a glycosylated or nonglycosylated 
2 5 protein of the formula 

p'-(linker^)3„-a-(linker'),-p' (1); or 

p^-(linker')^-p'-(linker'),-a (2); or 

a-(linker^)^- p^ -(linker'),- p' (3) 
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wherein each of and has the amino acid sequence of the p subunit of a 
vertabrate glycoprotein hormone or a variant of said amino acid sequence, wherein said 
variants are defined herein, "a" designates the a subunit of a vertabrate glycoprotein 
hormone or a variant thereof; "linker" refers to a covalently linked moiety that spaces the 
5 p^ and p^ subunits at appropriate distances from the a subunit and from each other. Each 
of m and n is independently 0 or 1 . 

In all of the foregoing cases, the single-chain form preserves conformation so that 
inclusion of the entire subunits in the single-chain forms is unnecessary. Thus, the 
invention includes compounds of formulas (1), (2) and (3) that comprise fragments of the 
10 a and/or p subunits wherein these forms retain the biological activity exhibited by the 
corresponding forms which contain the complete subunits. 

In other aspects, the invention is directed to recombinant materials and methods to 
produce the proteins of the invention, to pharmaceutical compositions containing them; to 
antibodies specific for them; and to methods for their use. 

15 

Brief Description of the Drawings 

Figure 1 shows the binding of the compound CGp-a-CTP-FSHp to the LH 
receptor in competition with hCG. 

Figure 2 shows the binding of the compound shown in Figure 1 to the FSH 
2 0 receptor in compeition with FSH. 

Modes of Carrying Out the Invention 

Four "glycoprotein" hormones in humans provide a family which includes human 
chorionic gonadotropin (hCG), follicle stimulating hormone (FSH), luteinizing hormone 
2 5 (LH), and thyroid stimulating hormone (TSH). As used herein, "glycoprotein hormones" 
refers to all the members of this family. All of these hormones are heterodimers 
comprised of a subunits which, for a given species, are identical in amino acid sequence 
among the group, and P subunits which differ according to the member of the family. 
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Thus, normally these glycoprotein hormones occur as heterodimers composed of a and p 
subunits that are associated but not covalently linked. Most vertebrates produce FSH, 
TSH and LH; chorionic gonadotropin has been found only in primates, including humans, 
and in pregnant mares. 

5 In animals, the a and p subimit of each hormone are encoded in different genes 

and are synthesized separately and then assembled into the noncovalent heterodimeric 
complex. In the compounds of the invention the P subxmits are directly hnked to an a 
subunit into a single-chain molecule which is essentially linear in primary structure. The 
three dimensional structure conferred by secondary and tertiary structural considerations 

10 and conformation is apparently sufficiently similar to the heterodimeric form to permit 
the functionality of the heterodimer represented by the p subunits to be exhibited. 
However, by suitable variation of the structures of the subunits, the compounds of the 
invention may have agonist or antagonist activity; for example, if the p subunits are 
different, the compounds may exhibit antagonist activity with respect to a receptor for 

1 5 one of the glycoprotein hormones but agonist activity for the receptor of another, or may 
have agonist or antagonist activity for both. The spectrum of the activities exhibited by 
the compounds of the invention will be dependent on the selection of the individual a and 
p subunits as well as the nature of the linker moieties and the orientation of the a and p 
subunits. 

20 In the most preferred embodiment of the invention, the compounds of formulas 

(1), (2) or (3) are fusion proteins wherein the a and p subxmits are linked head-to-tail 
either directly or through peptide linkers. Where only gene-encoded amino acids 
comprise the sequence, the compound can be synthesized recombinantly. However, it is 
unnecessary to restrict the compounds of the invention in this manner; the a and p 

2 5 subunits as well as the linkers may include amino acids that are not gene encoded. In 
addition, the linkers may be other than peptide-such as dicarboxylic acids or anydrides, 
diamines, or bifunctional linkers such as those sold by Pierce Chemical Co., Rockford, IL 
and the like. In addition, the submits may be linked either directly or through a linker in 
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a head-to-head or tail-to-tail configuration as well as a head-to-tail configuration as would 
be required in a fusion protein. Under these circumstances, for a head-to-head 
configuration, two amino groups may be linked through an anhydride or through any 
dicarboxylic acid derivative; two carboxyl groups can be linked through diamines or diols 
5 using standard activation techniques. 

However, for convenience the most preferred form is a head-to-tail configuration 
wherein standard peptide linkages suffice and the compound can be prepared as a fusion 
protein recombinantly or using synthetic peptide techniques either in a single sequence of 
reactions or, preferably, ligating individual portions of the entire sequence. 

1 0 Whatever the embodiment, the a and p subunits are joined to the remainder of the 

molecule at positions proximal to their N and C termini. It is preferred that these subunits 
be linked directly at their termini, however this linkage may simply be "proximal," In 
general, ''proximal" indicates a position which is in within 10 amino acids, preferably 
within five amino acids, more preferably within two amino acids of the terminus, and 

1 5 most preferably at the terminus per se. 

The Subunit Components 

As used herein, the common a subunit, and FSH, LH, TSH, and CG p subunits as 
well as the heterodimeric forms have their conventional definitions and refer to the 
2 0 proteins having the amino acid sequences known in the art per se, or allelic variants 
thereof, regardless of the glycosylation pattern exhibited or other derivatization of the 
amino acid side chains. 

"Native" forms of these peptides are those which have the amino acid sequences 
that have been isolated from the relevant vertebrate tissue, and have these known 
2 5 sequences per se^ or those of their allelic variants. 

"Varianf ' forms of these proteins and of CTP units (see below) are those which 
have deUberate alterations, including truncations, in amino acid sequences of the native 
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protein produced by, for example, site-specific mutagenesis or by other recombinant 
manipulations, or which are prepared synthetically. 

These alterations consist of 1-10, preferably 1-8, and more preferably 1-5 amino 
acid changes, including deletions, insertions, and substitutions, most preferably 
5 conservative amino acid substitutions. The resulting variants must retain an activity 
which affects the corresponding activity of the native hormone - i.e., either they must 
retain the biological activity of the native hormone so as to behave as agonists, or they 
must behave as antagonists, generally by virtue of being able to bind the receptors for the 
native hormones but lacking the ability to effect signal transduction. 

10 "Conservative analog" means, in the conventional sense, an analog wherein the 

residue substituted is of the same general amino acid category as that for which 
substitution is made. Amino acids have been classified into such groups, as is understood 
in the art, by, for example, Dayhoff, M. et al , Atlas of Protein Sequences and Structure 
(1972) 5:89-99. In general, acidic amino acids fall into one group; basic amino acids into 

15 another; neutral hydrophilic amino acids into another; and so forth. More specific 
classifications are set forth in WO 96/05224 incorporated by reference above. 

One set of preferred variants is that wherein the glycosylation sites of either the a 
or p subunits or both have been altered. Some useful variants of the hormone quartet 
described herein are set forth in U.S. Patent 5,177,193 issued 5 January 1993 and 

2 0 incorporated herein by reference. As shown therein, the glycosylation patterns can be 
altered by destroying the relevant sites or, in the alternative, by choice of host cell in 
which the protein is produced. 

Alterations in amino acid sequence also include both insertions and deletions. 
Thus, truncated forms of the hormones are included among variants, e.g., mutants of the 

25 a subunit which are lacking some or all of the amino acids at positions 85-92 at the 

C-terminus. In addition, a subunits with 1-10 amino acids deleted fi-om the N-terminus 
are included. 
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Variants also include those with noncritical regions altered or removed. Such 
deletions and alterations may comprise entire loops, so that sequences of considerably 
more than 10 amino acids may be deleted or changed. The resulting variants must, 
however, retain at least the receptor binding domains and/or the regions involved in 
signal transduction. 

There is considerable literature on variants of the glycoprotein hormones and it is 
clear that a large number of possible variants which result both in agonist and antagonist 
activity can be prepared. Such variants are disclosed, for example, in Chen, F. et al 
Molec Endocrinol (1992) 6:914-919; Yoo, J. et al J Biol Chem (1993) 268:13034-13042; 
Yoo, J. et al J Biol Chem (1991) 266:17741-17743; Puett, D. et al Glycoprotein 
Hormones , Lusbader, J.W. et al EDS, Springer Verlag New York (1994) 122-134; 
Kuetmann, H.T. et al (ibid) pages 103-117; Erickson, L.D. et al Endocrinology (1990) 
126:2555-2560; and Bielinska, M. et al J Cell Biol (1990) HI :330a (Abstract 1844). 

Other variants include those wherein one or more cystine-bond is deleted, 
typically by substituting a neutral amino acid for one or both cysteines which participate 
in the link. Particularly preferred cystine bonds which may be deleted are those between 
positions 26 and 110 and between positions 23 and 72. 

In addition, it has been demonstrated that the p subunits of the hormone quartet 
can be constructed in chimeric forms so as to provide biological functions of both 
components of the chimera, or, in general, hormones of altered biological function. Thus, 
chimeric molecules which exhibit both FSH and LH/CG activities can be constructed as 
described by Moyle, Proc Natl Acad Sci (1991) 88:760-764; Moyle, Nature (1994) 
368:251-255. As disclosed in these papers, substituting amino acids 101-109 of FSH-p 
for the corresponding residues in the CG-p subunit yields an analog with both hCG and 
FSH activity. 

As used herein "peptide" and "protein" are used interchangeably, since the length 
distinction between them is arbitrary. 
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As stated above, the "variants" employed as a and |3 subunits in forming 
compound of the invention with or v^ithout Unking moieties may represent the complete 
amino acid sequences of the subunits or only portions thereof 

"Variants" also include a and/or p chains which contain a CTP (or a variant of 
5 CTP) inserted into a noncritical region. 

"Noncritical" regions of the a and p subunits are those regions of the molecules 
not required for biological activity (including agonist and antagonist activity). In general, 
these regions are removed from binding sites, precursor cleavage sites, and catalytic 
regions. Regions critical for inducing proper folding, binding to receptors, catalytic 
1 0 activity and the like should be evaluated. It should be noted that some of the regions 

which are critical in the case of the dimer become noncritical in single chain forms since 
the conformational restriction imposed by the molecule may obviate the necessity for 
these regions. The ascertainment of noncritical regions is readily accomplished by 
deleting or modifying candidate regions and conducting an appropriate assay for the 
15 desired activity. Regions where modifications result in loss of activity are critical; 

regions wherein the alteration results in the same or similar activity (including antagonist 
activity) are considered noncritical. 

It should again be emphasized, that by "biological activity" is meant activity 
which is either agonistic or antagonistic to that of the native hormones. Thus, certain 
2 0 regions are critical for behavior of a variant as an antagonist, even though the antagonist 
is unable to directly provide the physiological effect of the hormone. 

For example, for the a subunit, positions 33-59 are thought to be necessary for 
signal transduction and the 20 amino acid stretch at the carboxy terminus is needed for 
signal transduction/receptor binding. Residues critical for assembly with the p subunit 
2 5 include at least residues 33-58, particularly 37-40. 

Where the noncritical region is "proximal" to the N- or C-terminus, the insertion 
is at any location within 10 amino acids of the terminus, preferably within 5 amino acids, 
and most preferably at the terminus per se, 
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As used herein, the "CTP unit" refers to an amino acid sequence found at the 
carboxy terminus of human chorionic gonadotropin p subunit which extends from amino 
acid 1 12-1 1 8 to residue 145 at the C-terminus or to a portion thereof. Thus, each 
"complete" CTP unit contains 28-34 amino acids, depending on the N-terminus of the 
5 CTP. 

By a "partial" CTP unit is meant an amino acid sequence which occurs between 
positions 1 12-1 18 to 145 inclusive, but which has at least one amino acid deleted from 
the shortest possible "complete" CTP unit (i.e. from positions 1 18-145). These "partial" 
sequences are included in the definition of "variants." The "partial" CTP units preferably 

1 0 contain at least one 0-glycosylation site. Some nonglycosylated forms of the hormones 
are antagonists and are usefiil as such. The CTP unit contains four glycosylation sites at 
the serine residues at positions 121 (site 1); 127 (site 2); 132 (site 3); and 138 (site 4). 
The partial forms of CTP useful in agonists will contain one or more of these sites 
arranged in the order in which they appear in the native CTP sequence, although 

1 5 intervening sites may be omited. 

In some cases, CTP units may be inserted or used as linkers in tandem. By 
"tandem" inserts or extensions is meant that the insert or extension contains at least two 
"CTP units." Each CTP unit may be complete or a fragment, and native or a variant. All 
of the CTP units in the tandem extension or insert may be identical, or they may be 

2 0 different from each other. 

The "linker moiety" is a moiety that joins the a and p sequences without 
interfering with the activity that would otherwise be exhibited by the same a and P chains 
as members of a heterodimer, or which alters that activity to convert it from agonist to 
antagonist activity. The level of activity may change within a reasonable range, but the 

2 5 presence of the linker cannot be such so as to deprive the single-chain form of both 

substantial agonist and substantial antagonist activity. The single-chain form does not 
represent a propeptide but the mature protein and must exhibit activity pertinent to the 
hormonal activity of the heterodimer, the elements of which form its components. 
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Preferred Embodiments of the Bifunctional Hormones 

The bifunctional hormones of the invention are most efficiently and economically 
produced using recombinant techniques. Therefore, fusion proteins comprising those 
5 forms of a and p chains, CTP units and other linker moieties which include only gene- 
encoded amino acids are preferred. It is possible, however, as set forth above, to 
construct at least portions of the single-chain hormones using synthetic peptide 
techniques or other organic synthesis techniques and therefore variants which contain 
nongene-encoded amino acids and nonpeptide based Unkers are also within the scope of 
10 the invention. 

In the most preferred embodiment, the C-terminus of the subunit is covalently 
linked, optionally through a linker, to the N-terminus of the mature a subunit which is in 
turn covalently linkered optionally through a linker to the subimit. The linkage can be 
a direct peptide linkage wherein the C-terminal amino acid of one subunit is directly 
15 linked through the peptide bond to the N-terminus of the other; however, in many 

instances it is preferable to include a linker moiety between the two termini. In many 
instances, the linker moiety will provide at least one P turn between the two chains. The 
presence of proline residues in the linker may therefore be advantageous. 

(It should be understood that in discussing linkages between the termini of the 
2 0 subunits comprising the single chain forms, one or more termini may be altered by 
substitution and/or deletion as described above.) 

In one particularly preferred set of embodiments, the linkage is head-to-tail and 
the linker moiety will include one or more CTP units and/or variants or truncated forms 
thereof. Preferred forms of the CTP units used in such linker moieties are described 
2 5 hereinbelow. 

Further, the linker moiety may include a drug covalently, preferably releasably, 
bound to the linker moiety. Means for coupling the drug to the linker moiety and for 
providing for its release are conventional. 
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In addition to their occurrence in the linker moiety, CTP and its variants may also 
be included in any noncritical region of the subunits making up the single-chain hormone 
as described above. 

While CTP imits are preferred inclusions in the linker moiety, it is understood that 
5 the linker may be any suitable covalently bound material which provides the appropriate 
spatial relationship between the a and p subunits. Thus, for head-to-tail configurations 
the linker may generally be a bivalent moiety such as a peptide comprising an arbitrary 
number, but typically less than 100, more preferably less than 50 amino acids which has 
the proper hydrophilicity/hydrophobicity ratio to provide the appropriate spacing and 

1 0 conformation in solution or a nonpeptide Knker which confers these characteristics. In 
general, the linker should be on balance hydrophilic so as to reside in the surroimding 
solution and out of the way of the interaction between the a and p subunits or the two p 
subunits. It is preferable that the linker include p turns typically provided by proline 
residues in peptide linkers, or comprise serine and/or glycine residues. Any suitable 

15 polymer, including peptide linkers, with the above-described correct characteristics may 
be used. 

Particularly preferred embodiments of the bifimctional hormones of the invention 
include in head-to-tail configuration: 

PFSH-a-pFSH; a-pFSH-pLH; pFSH-a-pLH; 
2 0 pLH-a-pLH; a-pLH-pFSH; pLH-a-pFSH; 

PTSH-a-pTSH; pTSH-pFSH-a; pTSH-a-pFSH; 

pCG-a-pCG; a-pCG-pFSH; a-pCG-pTSH; pCG-pPSH-a; pCG-a-pTSH; 
pFSH-CTP-a pFSH; a-pFSH-CTP-pLH; pFSH-CTP-a-pLH; 
pLH-CTP-a PLH; a-pLH-CTP-pFSH; pLH-a-CTP-pFSH; 
25 pLH(5115-123)-a-pFSH; pLH(5115-123)-CTP-a-pFSH; 

pCG-CTP-a CTP-pFSH-CTP-CTP; 
pTSH-CTP-CTP-a pFSH-CTP-CTP; 
pFSH-CTP-CTP-a-pLH; pLH-CTP-CTP-pLH-a; 
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pCG-CTP-CTP-a-pTSH; pCG-CTP-CTP-pLH-a; 
|3FSH-CTP-pLH(51 15-123)-CTP-a; 

and the like. Also particularly preferred are the human forms of the subunits. In the 
5 above constructions, "CTP" refers to CTP or its variants including truncations as 
described in No. 96/05224. 

While for human use, the human forms of the a and p subunits are desirable, it 
should be noted that the corresponding forms in other vertebrates are useful in veterinary 
contexts. Thus, the FSH, TSH and LH subunits characteristic of bovine, ovine, equine, 
1 0 porcine, feline, canine, and other species are appropriate to indications affecting these 
species per se. 

Suitable drugs that may be included in the linker moiety include peptides or 
proteins such as insulin-like growth factors; epidermal growth factors; acidic and basic 
fibroblast growth factors; platelet-derived growth factors; the various colony stimulating 

15 factors, such as granulocyte CSF, macrophage-CSF, and the like; as well as the various 
cytokines such as IL-2, IL-3 and the plethora of additional interleukin proteins; the 
various interferons; tumor necrosis factor; and the Uke, Suitable cleavage sites for the 
release of these drugs may be included, such as target sequences for proteases whose 
target sites are not present in the a and p subunits. Peptide- or protein-based drugs have 

2 0 the advantage that the entire construct can readily be produced by recombinant expression 
of a single gene. Also, small molecule drugs such as antibiotics, antiinflammatories, 
toxins, and the like can be used. 

In general, the drugs included within the linker moiety v^U be those desired to act 
in the proximity of the receptors to which the hormones ordinarily bind. Suitable 

2 5 provision for release of the drug from inclusion within the linker will be provided, for 

example, by also including sites for enzyme-catalyzed lysis as further described under the 
section headed Preparation Methods hereinbelow. 
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Other Modifications 

The single-chain proteins of the invention may be further conjugated or 
derivatized in ways generally understood to derivatize amino acid sequences, such as 
phosphorylation, glycosylation, deglycosylation of ordinarily glycosylated forms, 
5 acylation, modification of the amino acid side chains (e.g., conversion of proline to 
hydroxyproline) and similar modifications analogous to those posttranslational events 
which have been found to occur generally. 

The glycosylation status of the hormones of the invention is particularly 
important. The hormones may be prepared in nonglycosylated form either by producing 

1 0 them in procaryotic hosts or by mutating the glycosylation sites normally present in the 
subimits and/or any CTP units that may be present. Both nonglycosylated versions and 
partially glycosylated versions of the hormones can be prepared by manipulating the 
glycosylation sites. Normally, glycosylated versions are, of course, also included within 
the scope of the invention. 

15 As is generally known in the art, the single-chain proteins of the invention may 

also be coupled to labels, carriers, solid supports, and the like, depending on the desired 
application. The labeled forms may be used to track their metabolic fate; suitable labels 
for this purpose include, especially, radioisotope labels such as iodine 131, technetium 
99, indium 111, and the like. The labels may also be used to mediate detection of the 

2 0 single-chain proteins in assay systems; in this instance, radioisotopes may also be used as 
well as enzyme labels, fluorescent labels, chromogenic labels, and the like. The use of 
such labels permits localization of the relevant receptors since they can be used as 
targeting agents for such receptors. 

The proteins of the invention may also be coupled to carriers to enhance their 

2 5 immunogenicity in the preparation of antibodies specifically immunoreactive with these 
new modified forms. Suitable carriers for this purpose include keyhole limpet 
hemocyanin (KLH), bovine serum albumin (BSA) and diphtheria toxoid, and the like. 
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Standard coupling techniques for linking the modified peptides of the invention to 
carriers, including the use of bifunctional linkers, can be employed. 

Similar linking techniques, along with others, may be employed to couple the 
proteins of the invention to solid supports. When coupled, these proteins can then be 
5 used as affinity reagents for the separation of desired components with which specific 
reaction is exhibited. Thus, they are usefiil in the purification and isolation of the 
receptors with which the appropriate p subunit interacts. 



Preparation Methods 

1 0 Methods to construct the proteins of the invention are well known in the art. As 

set forth above, if only gene encoded amino acids are included, and the single-chain is in 
a head-to-tail configuration, the most practical approach at present is to synthesize these 
materials recombinantly by expression of the DNA encoding the desired protein. DNA 
containing the nucleotide sequence encoding the single-chain forms, including variants, 

15 can be prepared fi-om native sequences, or synthesized de novo or using combinations of 
these techniques. Techniques for site-directed mutagenesis, ligation of additional 
sequences, amplification such as by PGR, and construction of suitable expression systems 
are all, by now, well known in the art. Portions or all of the DNA encoding the desired 
protein can be constructed synthetically using standard solid phase techniques, preferably 

2 0 to include restriction sites for ease of ligation. Suitable control elements for transcription 
and translation of the included coding sequence can be provided to the DNA coding 
sequences. As is well known, expression systems are now available compatible with a 
wide variety of hosts, including procaryotic hosts such as E, coli or B, subtilis and 
eucaryotic hosts such as yeast, other fimgi such as Aspergillus and Neurospora, plant 

2 5 cells, insect cells, mammalian cells such as CHO cells, avian cells, and the like. 

The choice of host is particularly pertinent to posttranslational events, most 
particularly including glycosylation. The location of glycosylation is mostly controlled 
by the nature of the glycosylation sites within the molecule; however, the nature of the 
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sugars occupying this site is largely controlled by the nature of the host. Accordingly, a 
fine-tuning of the properties of the hormones of the invention can be achieved by proper 
choice of host. 

A particularly preferred form of gene for the a submit portion, v^hether the a 
5 subunit is modified or unmodified, is the "minigene" construction. As used herein, the a 
subunit "minigene" refers to the gene construction disclosed in Matzuk, M.M., et al, Mol 
Endocrinol (1988) 2:95-100, in the description of the construction of pMVCG a or pM^/I. 

For recombinant production, modified host cells using expression systems are 
used and cultured to produce the desired protein. These terms are used herein as follows: 

10 A "modified" recombinant host cell, i.e., a cell "modified to contain" the 

recombinant expression systems of the invention, refers to a host cell which has been 
altered to contain this expression system by any convenient manner of introducing it, 
including transfection, viral infection, and so forth. "Modified cells" refers to cells 
containing this expression system whether the system is integrated into the chromosome 

15 or is extrachromosomal. The "modified cells" may either be stable with respect to 
inclusion of the expression system or the encoding sequence may be transiently 
expressed. In short, recombinant host cells "modified" with the expression system of the 
invention refers to cells which include this expression system as a result of their 
manipulation to include it, when they natively do not, regardless of the manner of 

2 0 effecting this incorporation. 

"Expression system" refers to a DNA molecule which includes a coding 
nucleotide sequence to be expressed and those accompanying control sequences 
necessary to effect the expression of the coding sequence. Typically, these controls 
include a promoter, termination regulating sequences, and, in some cases, an operator or 

2 5 other mechanism to regulate expression. The control sequences are those which are 
designed to be fimctional in a particular target recombinant host cell and therefore the 
host cell must be chosen so as to be compatible with the control sequences in the 
constructed expression system. 
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If secretion of the protein produced is desired, additional nucleotide sequences 
encoding a signal peptide are also included so as to produce the signal peptide operably 
linked to the desired single-chain hormone to produce the preprotein. Upon secretion, the 
signal peptide is cleaved to release the mature single-chain hormone. 
5 As used herein "cells," "cell cultures," and "cell lines" are used interchangeably 

without particular attention to nuances of meaning. Where the distinction between them 
is important, it will be clear from the context. Where any can be meant, all are intended 
to be included. 

The protein produced may be recovered from the lysate of the cells if produced 
1 0 intracellularly, or from the medium if secreted. Techniques for recovering recombinant 
proteins from cell cultures are well understood in the art, and these proteins can be 
purified using known techniques such as chromatography, gel electrophoresis, selective 
precipitation, and the like. 

All or a portion of the hormones of the invention may be synthesized directly 
1 5 using peptide synthesis techniques known in the art. Synthesized portions may be 
ligated, and release sites for any drug contained in the Hnker moiety introduced by 
standard chemical means. For those embodiments which contain amino acids which are 
not encoded by the gene and those embodiments wherein the head-to-head or tail-to-tail 
configuration is employed, of course, the synthesis must be at least partly at the protein 
2 0 level Head-to-head junctions at the natural N-termini or at positions proximal to the 
natural N-termini may be effected through linkers which contain functional groups 
reactive with amino groups, such as dicarboxylic acid derivatives. Tail-to-tail 
configiirations at the C-termini or positions proximal to the C-termini may be effected 
through linkers which are diamines, diols, or combinations thereof 

25 
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Antibodies 

The proteins of the invention may be used to generate antibodies specifically 
immunoreactive with these new compounds. These antibodies are useful in a variety of 
diagnostic and therapeutic applications. 
5 The antibodies are generally prepared using standard immunization protocols in 

mammals such as rabbits, mice, sheep or rats, and the antibodies are titered as polyclonal 
antisera to assure adequate immunization. The polyclonal antisera can then be harvested 
as such for use in, for example, immunoassays. Antibody-secreting cells from the host, 
such as spleen cells, or peripheral blood leukocytes, may be immortalized using known 

1 0 techniques and screened for production of monoclonal antibodies immunospecific with 
the proteins of the invention. "Antibodies" include any fi-agment which retains the 
required immunospecificity, such as F^^. F^,, F^^^.^2:> F^ and so forth. Thus, the antibodies 
may also be prepared using recombinant techniques, typically by isolating nucleotide 
sequences encoding at least the variable regions of monoclonal antibodies with the 

15 appropriate specificity and constructing appropriate expression systems. This approach 
permits any desired modification such as production of forms, chimeric forms, 
"humanized" forms and the like. 

By "immunospecific for the proteins of the invention" is meant antibodies which 
specifically bind the referent compound of the invention, but not the heterodimers or any 

20 of the included subunits per se or any single-chain forms which include only a single p 
subunit within the general parameters considered to determine affinity or nonaffinity. It 
is understood that specificity is a relative term, and an arbitrary limit could be chosen, 
such as a difference in specific binding of 100-fold or greater. Thus, an immunospecific 
antibody included within the invention is at least 100 times more reactive with the single- 

2 5 chain protein than with the corresponding heterodimers, prior art single-chain forms or 
separate subunits. Such antibodies can be obtained, for example, by screening for those 
that bind the invention compounds and discarding those that also bind the heterodimers, 
subunits or prior art single-chain forms set forth in WO95/22340 and WO96/05224. 
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Formulation and Methods of Use 

The proteins of the invention are formulated and administered using methods 
comparable to those known for the heterodimers corresponding to them. Thus, 
5 formulation and administration methods will vary according to the particular hormone or 
hormone combination used. However, the dosage level and frequency of administration 
may be altered as compared to the heterodimer, especially if CTP units are present in 
view of the extended biological half life due to its presence. 

Formulations for proteins of the invention are those typical of protein or peptide 

1 0 drugs such as found in Remington's Pharmaceutical Sciences , latest edition, Mack 
Publishing Company, Easton, PA. Generally, proteins are administered by injection, 
typically intravenous, intramuscular, subcutaneous, or intraperitoneal injection, or using 
formulations for transmucosal or transdermal delivery. These formulations generally 
include a detergent or penetrant such as bile salts, fusidic acids, and the like. These 

15 formulations can be administered as aerosols or suppositories or, in the case of 

transdermal administration, in the form of skin patches. Oral administration is also 
possible provided the formulation protects the peptides of the invention from degradation 
in the digestive system. 

Optimization of dosage regimen and formulation is conducted as a routine matter 

2 0 and as generally performed in the art. These formulations can also be modified to include 
those suitable for veterinary use. 

The compounds of the invention may be used in many ways, most evidently as 
substitutes for the heterodimeric forms of the hormones. Thus, like the heterodimers, the 
agonist forms of the single-chain hormones of the invention can be used in treatment of 

2 5 infertility, as aids in in vitro fertilization techniques, and other therapeutic methods 

associated with the native hormones. These techniques are applicable to humans as well 
as to other animals. The choice of the single-chain protein in terms of its species 
derivation will, of course, depend on the subject to which the method is applied. It will 
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be realized that the dual functionality which is conferred on those compounds which 
contain two different p subunits confers opportunities for therapies that have previously 
been unavaiable. 

The invention compounds are also useful as reagents in a manner similar to that 
employed with respect to the heterodimers. 

In addition, the compounds of the invention may be used as diagnostic tools to 
detect the presence or absence of antibodies that bind to the native proteins to the extent 
such antibodies bind to the relevant portions of these single chain compounds in 
biological samples. They are also useful as control reagents in assay kits for assessing the 
levels of these hormones in various samples. Protocols for assessing levels of the 
hormones themselves or of antibodies raised against them are standard immunoassay 
protocols commonly known in the art. Various competitive and direct assay methods can 
be used involving a variety of labeling techniques including radio-isotope labeling, 
fluorescence labeling, enzyme labeling and the like. 

The compounds of the invention are also useful in detecting and purifying 
receptors to which the native hormones bind. Thus, the compounds of the invention may 
be coupled to solid supports and used in affinity chromatographic preparation of receptors 
or antihormone antibodies. The resulting receptors are themselves useful in assessing 
hormone activity for candidate drugs in screening tests for therapeutic and reagent 
candidates. Of course, account must be taken of the dual specificity of the P subunits in 
any of these compounds where the p subunits are different. However, where the two p 
subunits are identical, they offer a powerful affinity purification tool for the relevant 
receptor. 

Finally, the antibodies uniquely reactive with the compoimds of the invention can 
be used as purification tools for isolation of these materials in their subsequent 
preparations. They can also be used to monitor levels of these compounds administered 
as drugs. 
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The following examples are intended to illustrate but not to limit the invention. 

Example 1 
Preparation of CGB-a-CTP-FSHB 
A nucleotide sequence encoding the title compound was prepared using the 
available nucleotide sequences for the relevant portions of the subimits. The CGp region 
encodes the 145 amino acids of human CGp; the a subunit-encoding nucleotide sequence 
encodes the 92 amino acids of human a as the minigene; the CTP-encoding sequence 
encodes 28 amino acids representing positions 118-145 of human chorionic 
gonadotropin; and the FSHp encoding region encodes the 1 1 1 amino acids of the human 
FSHp subunit. 

An amplified fragment containing CGp exon 3, the a minigene, CTP and PPSH 
was inserted into the Sail site of pM^HA-CGpexonl,2 an expression vector which is 
derived from pM^ and containing CGp exons 1 and 2 in the maimer described by Sachais, 
p Biol Chem (1993) 268:2319. pM^ containing CGp exons 1 and 2 is described in 
Matzuk, M.M, et al Proc Natl Acad USA (1987) 84:6354-6358 and Matzuk, M.M. et al 
J Cell Biol (1988) 106:1049-1059. First, a fragment containing the a minigene 
downstream of CGp exon 3 was inserted into this vector to obtain pM^-HACGpa. pM^- 
HACGpa was then cleaved with 5ca/and ligated with S'ca/ restricted pBIIKS(+)a-CTP- 
FSH. The resulting expression vector pM^-HACGp-a-CTP-FSH produces the title 
compound when inserted into a suitable host. 

Example 2 

Production and Activity of the CGp-a-CTP-FSHB 
The expression vector constructed in Example 1 was transfected into Chinese 
hamster ovary (CHO) cells and production of the protein was assessed by 
immunoprecipitation of radiolabeled protein on SDS gels. 
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The culture medium was collected, concentrated and tested for binding to the 
human LH receptor (expected to bind the pCG- a portion). 

For this assay, the LH receptor was prepared by inserting the cDNA encoding the 
entire human LH receptor into the expression vector pCMX (Oikawa, J. X-C et al Mol 
Endocrinol (1991) 5:759-768). Exponentially growing 293 cells were transfected with 
this vector using the method of Chen, C. et al Mol Cell Biol (1987) 7:2745-2752, 
resulting in expression of the LH receptor at the surface. 

In the assay, the cells expressing human LH receptor (2 x lOVtube) were 
incubated with 1 ng of labeled hCG in competition with increasing concentrations of 
unlabeled hCG or increasing amounts of the sample to be tested at 22°C for 18 hours. The 
decrease in label in the presence of sample measures the binding ability in the sample. In 
this assay, with respect to the human LH receptor in 293 cells, the heterodimeric hCG had 
an activity typical of wild-type as previously determined and the CGp-a-CTP-FSHp- 
containing medium also showed activity. These results are shown in Figure 1. As 
shown, both heterodimeric (solid squares) hCG and the bifunctional single-chain protein 
of the invention (solid circles) competed successfully with labled hCG for LH receptor. 
The bifunctional compound is less potent due to the modification of the a subunit 
carboxy terminus. 

Also shown in Figure 1 are the results of the assay wherein varying amounts of a 
culture supernatant derived from cells modified to contain two expression systems was 
tested. One expression system produced a single chain FSHp-a; the other produced the p 
subunit of hCG. The resulting noncovalently associated single-chain FSHa-p/CGP 
complex (solid triangles) also successfully competed for binding. 

In a similar manner, the supernatant from the culture mediimi containing CGp-a- 
CTP-FSHp was tested for binding to the receptor for FSH, expressed in 293 cells. The 
assay was conducted in the manner described above, except that cells expressing the 
human FSH receptor were substituted for those expressing human LH receptor and 
labeled FSH was used as the competitor. The results of this assay are shown in Figure 2. 
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As shown, the single-chain title compound (solid circles) competed successfully 
with FSH (soUd squares) for binding. In an unrelated experiment, also shown in Figure 
2, the mixture of a different type of complex ~ i.e., single-chain FSHP-a noncovalently 
associated with CGp ~ which is mixed with imcomplexed excess single-chain FSH^-a 
5 (solid triangles), was an excellent competitor. 

Example 3 

Construction of Additional Expression Vectors 
In a manner similar to that set forth in Example 1, expression vectors for the 
1 0 production of single-stranded bifunctional FSHp-CTP-a-CG p; a- FSHp-CTP-CG p, CO 
P- pFSH-CTP-a, and pLH-CTP- pFSH-CTP-a are prepared and transfected into CHO 
cells. The culture supematants are cultured and tested as described above with respect 
both to the LH and FSH receptors. These compounds, too, show ability to bind both 
receptors. 

15 
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Claims 



1 . A glycosylated or nonglycosylated protein having agonist and/or 
antagonist activity of the formula 



wherein each of and has the amino acid sequence of the p subunit of a 
vertabrate glycoprotein hormone or a variant thereof; 

"a" designates the a subunit of a vertabrate glycoprotein hormone or a variant 
thereof; 

"linker" refers to a covalently linked moiety that spaces the p^ and p^ subunits at 
distances from the a subunit and from each other effective to retain said activity, and 
each of m and n is independently 0 or 1 . 

2. The protein of claim 1 wherein said m and n are 1 . 

3 . The protein of claim 1 wherein at least one said linker moiety includes a 
drug to be targeted to the receptor for the glycoprotein hormone, or wherein at least one 
linker is CTP or a variant thereof 

4. The protein of claim 1 wherein p^ is the p subunit of FSH, LH or TSH 
extended at a position proximal to its C-terminus by a complete or partial CTP unit or 
variant thereof. 



p^-ainker^),-a-(linker')„-p' 
p^-(linker^)^-p'-(hnker'),-a 
a-(linker^)^- p^-(linker'),-p' 



(1); or 



(2); or 



(3) 
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5. The protein of claim 1 wherein the a subunit or one or more p submits or 
both are modified by the insertion of a CTP unit or variant thereof into a noncritical 
region thereof and/or wherein said Hnker moiety includes a CTP unit or variant thereof 

7. The protein of claim 1 wherein said variants contain 1-5 conservative 
amino acid substitutions as referred to the native forms or are truncated forms of said 
sequences or both. 

8. A pharmaceutical composition which comprises the protein of claim 1 in 
admixture with a suitable pharmaceutical excipient. 

9. The protein of claim 1 coupled to a solid support. 

1 0. Antibodies immunospecific for the protein of claim 1 . 

11. A DNA or RNA molecule which comprises a nucleotide sequence 
encoding the protein of claim L 

12. An expression system for production of an agonist and/or antagonist of a 
glycoprotein hormone which expression system comprises a first nucleotide sequence 
encoding the protein of claim 1 operably linked to control sequences capable of effecting 
the expression of said first nucleotide sequence. 

13. The expression system of claim 12 which further contains a second 
nucleotide sequence encoding a signal peptide operably linked to the protein encoded by 
said first nucleotide sequence. 

14. A host cell modified to contain the expression system of claim 12. 
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15. A host cell modified to contain the expression system of claim 1 3 

16. A method to produce a single-chain protein which is an agonist and/or 
antagonist of a glycoprotein hormone which method comprises culturing the cells of 
claim 14 under conditions wherein said protein is produced; and 

recovering said protein from the culture. 

17. A method to produce a single-chain protein which is an agonist and/or 
antagonist of a glycoprotein hormone which method comprises culturing the cells of 
claim 15 under conditions wherein said protein is produced; and 

recovering said protein from the culture. 
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Abstract 



Single-chain agonists and/or antagonists of the glycoprotein hormones are 
disclosed. These proteins are of the formula 



wherein each of and has the amino acid sequence of the p subunit of a 
vertabrate glycoprotein hormone or a variant thereof; "a" designates the a subunit of a 
vertabrate glycoprotein hormone or a variant thereof; "linker" refers to a covalently 
linked moiety that spaces the P' and P^ subunits at distances from the a subunit and from 
each other effective to retain said activity, and each of m and n is independently 0 or 1. 



pi-(linker')„-a-(linker2)„-p2 
P'-(linker'),-P'-(linker')„-a 
a-(linker')„- p'-(linker')„-p^ 



(1); or 



(2); or 



(3) 
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